Distance-based versus Tree-based Key Recognition in Musical Audio
نویسندگان
چکیده
A new method for the recognition of the tonal center or key in a musical audio signal is presented. Time-varying key feature vectors of 264 synthesized sounds are extracted from an auditory-based pitch model and converted into character strings using PCA-analysis and classification trees. Results are compared with distance-based methods. Examples are given of the characteristics of the new tonality analysis tool. The potential of this method as a building stone in a music retrieval system is discussed.
منابع مشابه
Automatic Transcription of Audio Signals
This thesis is concerned with automatic transcription of monophonic audio signals into the MIDI representation. The transcription system incorporates two separate algorithms in order to extract the necessary musical information from the audio signal. The detection of the fundamental frequency is based on a pattern recognition method applied on the constant Q spectral transform. The onset detect...
متن کاملMusical Instrument Identification based on a Short-Time Spectral Analysis
In this paper, an implementation of an audio recognition system for personal computers is presented, combining methodologies of digital signal processing, machine perception, and statistical decision models. In particular, attention was given to musical tones, harmonics, and MIDI notes, that build the musical context to identify two musical instruments from their corresponding musical notes. Al...
متن کاملPredicting Key Recognition Difficulty in Music Using Statistical Learning Techniques
In this paper, the authors use statistical models to predict the difficulty of recognizing musical keys from polyphonic audio signals. The key recognition difficulty provides important background information when comparing the performance of audio key finding algorithms that often evaluated using different private data sets. Given an audio recording, represented as extracted acoustic features, ...
متن کاملBridging Printed Music and Audio Through Alignment Using a Mid-level Score Representation
We present a system that utilizes a mid-level score representation for aligning printed music to its audio rendition. The mid-level representation is designed to capture an approximation to the musical events present in the printed score. It consists of a template based note detection frontend that seeks to detect notes without regard to musical duration, accidentals or the key signature. The p...
متن کاملCombining pattern recognition and deep-learning-based algorithms to automatically detect commercial quadcopters using audio signals (Research Article)
Commercial quadcopters with many private, commercial, and public sector applications are a rapidly advancing technology. Currently, there is no guarantee to facilitate the safe operation of these devices in the community. Three different automatic commercial quadcopters identification methods are presented in this paper. Among these three techniques, two are based on deep neural networks in whi...
متن کامل